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ABSTRACT 



A self-repair method for a random access memory (RAM) 
array comprises writing a value to the memory array, reading 
a value from the memory array and comparing the read and 
write values to identify faulty memory cells in the memory 
array. An address of a newly-discovered faulty memory cell 
is compared to at least one address of at least one previously- 
discovered faulty memory cell. The address of the newly 
discovered faulty memory cell is stored if a column or row 
address of the newly-discovered faulty cell does not match 
any column or row address, respectively, of a previously- 
discovered faulty memory cell. Flags are set to indicate that 
a spare row or a spare column must replace the row or 
column, respectively, identified by the address of the 
previously-discovered faulty memory cell, if the row or 
column address of the newly-discovered memory cell 
matches the respective row or column address of the 
previously-discovered faulty memory cell. Spare rows and 
coliunns that have been indicated by the flags as requiring 
replacement are allocated to replace faulty rows and col- 
umns respectively. The remaining spare rows and columns 
whose row and column addresses respectively have been 
stored are then allocated. Furthermore, the RAM is flagged 
as unrepairable if an insufGcient number of spare rows or 
spare cells remain to replace all of the rows or columns 
containing faulty cells. 

29 Claims, 6 Drawing Sheets 
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EMBEDDED RAM WITH SELF-TEST AND 
SELF-REPAIR WITH SPARE ROWS AND 
COLUMNS 

BACKGROUND OF THE INVENTION 

Modem microprocessors employ large on-chip random 
access memories (RAMs) in a variety of ways to enhance 
performance. These RAMs are typically static (SRAMs) due 
to associated speed advantages. The most common usage is 
in the form of on-chip caches. In many instances, such 
RAMs constitute the majority of transistors consumed on 
chip and are the largest occupants of chip area. 

Embedded RAMs give rise to two particular problems 
during chip manufacturing. Because an embedded RAM 
occupies a significant portion of a chip's area, the probabil- 
ity that a defect lies within the RAM is relatively high. The 
RAM thus becomes a controlling factor in chip yield. 
Second, the embedding of RAM not only makes its own 
testing difficult, but also impairs testability of all other 
functions on chip, such as the core logic. For example, much 
of the testing of other functions requires the use of the 
embedded RAM, which must be functioning properly. 

Traditionally, semiconductor manufacturers have tackled 
RAM yield problems by incorporating a repair scheme with 
redundant rows and/or columns. For embedded RAM, 
however, this compounds the testing problems because a 
diagnosis to identify defects and the repair of those defects 
are required before the testing of core logic, for example, can 
begin. 

Recently, Built-in Self -Test (BiST) and Built-in Self- 
Repair (BiSR) have been proposed as potential solutions to 
both of the above problems. The scheme presented by Koike 
et al, "A 30 ns 64 Mb DRAM with Built-in Self-Test and 
Repair Function", Int'l Solid State Circuits Conf., pp 
150-151, February 1992, self-repairs only field failures. It 
employs traditional row-column repair to remove manufac- 
turing defects. In the traditional method, a specialized RAM 
tester tests integrated circuit RAMs, and gathers and ana- 
lyzes failure information, while the repair is done by blow- 
ing fuses. Koike's on-chip scheme employs an on-chip 
microprogram read-only-memory (ROM) BiST scheme and 
self-repair logic block with a spare memory block. 

The scheme presented by Chen and Sunada, "Design of a 
Self-testing and Self -repairing Structure for Highly Hierar- 
chical Ultra Large Capacity Memory Chips," IEEE Trans, 
On VLSI Systems, pp. 88-97, Vol. 1, No. 2, June 1993, 
employs an on-chip RISC processor to collect and analyze 
a full failure bitmap to derive a repair solution. Besides the 
complexities of the RISC processor, the method also 
requires that a large enough block of the RAM under test 
must be available and fault-free to store a failure bitmap. 

The BiST/BiSR schemes presented by Trueuer and 
Agarwal, "Buill-in Self-Diagnosis for Repairable Embedded 
RAMs," IEEE Design and Test of Computers^ pp. 24-33, 
June 1993, and Bhavsar and Edmondson, "Testability Strat- 
egy of the Alpha AXP 21164 Microprocessor," Int'l Test 
Conference, October 1994, are limited to self-repair with 
only spare rows. 

SUMMARY OF THE INVENTION 

The problem with the previous approaches here has been 
high complexity, e.g., the RISC processor of Chen and 
Sunada, or limited repair capability, e.g., using only spare 
rows or only spare columns. The present invention concerns 
a method for the self-repair of a RAM with both a spare row 
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and a spare column using a significantly simpler built-in 
self-test/built-in self-repair (BiST/BiSR) logic. The scheme 
affords a greater flexibility in spare resource allocation and 
therefore can result in higher yield while utilizing simplified 
5 self-test/self -repair logic. 

In accordance with a preferred embodiment of the present 
invention, a self-repair method for a random access memory 
(RAM) array comprises: 
writing a value to the memory array; 
reading a value firom the memory array and comparing the 
read and write values to identify faulty memory cells in 
the memory array; 
comparing an address of a newly-discovered faulty 
memory cell to at least one address of at least one 
previously-discovered faulty memory cell; 
storing the address of the newly discovered faulty 
memory cell, if a column or row address of the newly- 
discovered faulty cell does not match any column or 
20 row address, respectively, of a previously-discovered 
faulty memory cell; and 
setting flags to indicate that a spare row or a spare column 
must replace the row or column, respectively, identified 
by the address of the previously-discovered faulty 
25 memory cell, based on a present state of the flags and 
whether the row and/or column address of the newly- 
discovered memory cell matches the respective row 
and/or column address of one or more previously- 
discovered faulty memory celU. 
30 The present state of the flags includes the present state of 
row and column "must'* flags, and entry valid flags. 
Preferably, spare rows and column slices are allocated to 
replace faulty rows and columns respectively as indicated by 
the flags. Any remaining spare rows and columns, the 
35 addresses of which have been stored, are then allocated. 
Furthermore, the RAM is flagged as unrepairable if an 
insufiScient number of spare rows or columns remain to 
replace aU of the rows or columns containing faulty cells. 
The preferred embodiment comprises a plurality of 
40 memory blocks, each block comprising a plurality of 
memory cells organized into rows and N-bit wide column 
slices, where N is 1 or greater. Each slice represents one bit 
of the addressed word. A portion of the address is used to 
select a row. Another portion is used to select a particular 
45 column within each column slice. A spare row and column 
slice are available to replace a row or column slice having 
a faulty cell. In storing spare column shce allocation 
information, it is preferable to use a unique column slice 
address, or identifier, to identify the replaced column slice. 
50 Note that, while in the preferred embodiment a spare column 
slice, rather than an individual spare column, is available to 
replace an entire column slice having a faulty cell, the 
concept is the same as for individual column replacement, 
and the present invention applies equally well to such a 
55 system. In this case, the terms column slice address and 
column address are synonymous. In general, however, we 
use column, or column slice address (or identifier) to iden- 
tify the group (individual, complete slice, or otherwise) of 
columns being replaced. 
60 A test and repair logic circuit locates faulty cells within 
the memory array. A failure bitmap has at least two entries, 
where each entry comprises a row address field and a 
column address field for storing the row and column slice 
address of a faulty cell detected by the test and repair logic 
65 circuit. In addition, each entry comprises a flag for indicat- 
ing that the row indicated in the row address field must be 
replaced with a spare row. Similarly, each entry comprises a 
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flag for indicating that the column indicated in the column 
address field must be replaced with a spare column. In a 
2-entry failure bitmap, this signifies that at least two faulty 
cells arc present in the indicated row or column. A spare 
allocation logic circuit uses the failure bitmap's contents to 
decide how to allocate the spare rows and spare colunuis. 

A segment allocation map comprising row-column repair 
registers and logic is maintained in which spare row and 
column allocation information generated after the self-test 
by the failure bitmap and spare allocation logic is stored. 
Alternately, if allocation information has previously been 
programmed into an array of fiise links during manufacture, 
the allocation information is read from the fuse array, and 
the segment allocation map coofigured accordingly. 

Finally, the preferred embodiment, further comprises a 
column identifier encoder for encoding column identifiers of 
columns having faulty cells. A storage means is provided for 
storing encoded column identifiers and row identifiers. Such 
storage means includes, but is not limited to, a fuse array, a 
failure bitmap, or off-chip storage, such as may exist within 
a test system. Such storage means is also intended to 
encompass identifiers that are not physically stored but are 
determined, possibly on the fly, by the lest system and 
down-loaded to the chip. 

The repair engine of the preferred embodiment comprises 
a decoder for decoding an encoded identifier. The decoder 
itself comprises a counter which cycles through at least all 
of the allowed encoded identifier values. A comparator 
compares the counter's value with a stored encoded identi- 
fier. The comparator's output indicates whether there is a 
match. A validity check circuit determines whether the 
counter's value is a valid encoded identifier, and provides an 
indication of the validity. When an encoded identifier is 
valid, the comparator output is sent to the spare allocation 
logic circuit, preferably by shifting it out over a serial line. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and other objects, features and advantages 
of the invention will be apparent from the following more 
particular description of preferred embodiments of the 
invention, as illustrated in the accompanying drawings in 
which like reference characters refer to the same parts 
throughout the different views. The drawings are not nec- 
essarily to scale, emphasis instead being placed upon illus- 
trating the principles of the invention. 

FIG. 1 is a block diagram illustrating a sample silicon 
wafer with multiple chips, each chip having a RAM with 
multiple arrays. 

FIG. 2 is a block diagram of the repair architecture of the 
preferred embodiment of the present invention. 

FIG. 3 is a flowchart corresponding to procedure executed 
by the BiST/BiSR Engine (BEE) of FIG. 2. 

FIG. 4 is a block diagram illustrating a segment of an 
array from FIG. 1 according to a preferred embodiment of 
the present invention. 

FIG. 5 is a block diagram of the Failure Bitmap and 
Analysis Logic of FIG. 2. 

FIG. 6 is a flowchart of the test process of the present 
invention. 

FIG. 7 is a schematic diagram of the Repair Engine of 
FIG. 2. 

FIG. 8 is a schematic diagram of the decoder of FIG. 7. 

DETAILED DESCRIPTION OF THE 
INVENTION 

FIG. 1 illustrates a typical silicon wafer 10 during fabri- 
cation and before being cut up into individual integrated 
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circuit chips. An array 11 of chips 12 is created through a 
sequence of semiconductor processing steps. The bottom 
half of FIG. 1 illustrates how different areas of a single chip 
12 are devoted to certain functions. For example, in the 
5 preferred embodiment, each chip 12 comprises some core 
logic 16, which may contain, for example, arithmetic or 
floating point logic units and registers. In addition, each chip 
comprises an on-chip RAM cache 80. The RAM 80 is 
physically arranged into an array of identical memory seg- 
ments 251. The core logic 16 accesses the RAM 80 via a 
memory bus 20 which carries address, data and control 
signals. 

According to the invention, each RAM segment 251 is 
independently repairable. Logic 201 for performing a self- 
test and self-repair algorithm also resides on the chip 12, and 
communicates with the RAM 80 via the memory bus 20. A 
separate RepairData signal 229 transfers data that tells the 
RAM 80 how to repair itself. This repair information is 
derived either from a self-test or from a remote fuse array 
211 on the chip, or may be loaded in from an external source. 

Before the wafer 10 is cut up, each chip 12 is tested for 
functionality. A series of finger probes 13 descend upon each 
chip in tura, making contact with pads 14 which connect to 
the input/output drivers of the chip 12. Through these 
2^ connections, an external system 15 applies various signals, 
thoroughly testing the functionality of the chips. 

A particular benefit of the present invention fies in the 
time savings during manufacture. Typically, during testing, 
before a wafer is cut into individual chips, the probe 13 
descends on each chip 12 to test it. The test system 15 
applies test stimuli via wafer probe 13, and generates a fault 
bitmap for each chip, which is off-loaded to the test system 
15. The wafer probe is removed and the RAM 80 is repaired 
by selectively laser-zapping or cutting fuses 211. The wafer 
2j probe is again placed on the chip which is tested again to 
verify the laser-zapping. After the memory 80 is verified, the 
remainder of the chip's logic 16, which includes logic, is 
tested. 

With the present invention, the laser repair step may be 
postponed because the array is able to self-repair. Thus the 
core logic 16 is tested and if a particular chip has been found 
to be bad, the extra steps of removing the probe, laser 
zapping and replacing the probe for a final test, have been 
obviated. In an alternate embodiment, no fuses are blown, 

45 saving the extra steps in every case. In this case, the RAM 
self-repairs upon each power up. 

FIG. 2 shows the test and repair logic circuit 201, an 
embedded target RAM segment 251 and the remote fuse 
array 211. The control logic 201 includes the BiST/BiSR 

50 Engine (BBE) 203. the Repair Engine 205, the RAM Test 
Algorithm Engine 207 and the Failure CAM Array 209. The 
embedded target RAM segment 251 includes a RAM array 
81, associated Address and RcadAVrite Datapath Logic 255 
and the Row-Column Repair Registers and Logic 257. 

55 The BBE 203 controls the sequence of self-test and 
self-repair steps for each RAM segment 251. The RAM Test 
Algorithm Machine 207 generates an address, data and 
read/write control signal 221 sequence according to some 
RAM testing algorithm, such as the well-known MARCH 

60 algorithm. 

The BiST/BiSR Engine (BBE) 203 is a finite state 
machine that schedules the testing and repairing activities 
for the segments 251 in the RAM 80. It also allows a test 
system to extract repair information during manufacturing 
65 test. The BBE 203 begins its testing sequence at power up 
when the DoPwrBB signal 241 is asserted, or during manu- 
facturing testing when the DoMfgBB signal 242 is asserted. 
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The Address and Read/Write Datapath Logic 255 passes The BiST-2 and the Pause-2 steps are identical to the 

the data, address and read/write control signals 221 gcner- BiST-1 and the Pause-1 steps. The Pre-test and Post-test flow 

atcd by the RAM Test Algorithm Engine 207 on to the is repeated for every segment. Thus, after execution of the 

addressed RAM array 81. This logic 255 also compares the Post-test, branch 601 is taken if there are additional seg- 
data read from the RAM anray 81 with the expected data and 5 ments to be processed, while branch 603 is taken when 

upon an eiTor a^rts an error signal 22^ complete, i.e., when all segments have been 

address 225 of a detected bad column slice. The address 225 processed. 

may be encoded. If more than one column slice is faulty for ^ c - . , , 1 *u nr^r^ _ .1. j 

a given address, the multibad signal 227 is asserted. A bad . ^^^Z'^^ ^^^1 BBE asserts the Bbedone 

row address is provided (at 208) by the RAM Test Algorithm ^1^"^ If, when the test is complete and provides a pass/fail 

Engine 107. signal 244 to mdicate whether RAM is repairable. 

The Failure Bitmap and Analysis Logic 209 receives and ^ illustrates the Address and ReadAVrite Datapath 

analyzes in real time the error 223 and multibad 227 signals, Logic 255, the RAM array 81, and the Row-Column Repair 

and the failing row and column addresses badColumnId 225 Registers and Logic 257 within a RAM segment 251. An 

and badRowAddr 208, and maintains up-to-date failure address94, which is part of the data/address lines 20 of HG. 

bitmap information. This information is the basis of the 2, is logically divided into three portions 94A-94C. One 

self-repair, and is ofif-loaded to the test system via the portion 94C is used to select a particular RAM segment 251; 

on-chip pads 14 in one embodiment. 94A selects the row and 94B selects the memory cell column 

With the self -repair information now available, the extcr- within each slice, 

nal system programs the Remote Fuse Array 211 by burning Without loss of generality, RAM array 81 is shown 

or "blowing" selected fuses with a laser beam, thereby comprising eighty-three column slices 91, shown as Cblumn 

permanently storing the addresses of rows and column slices Slice(0)-Column SIice(82), plus a spare column slice 92, 

needing replacement. which are all accessed in parallel. Each slice 91, 92 com- 

This self-repair information is also presented to the Repair prises 128 rows which are addressed by a 7-bit portion 94A 
Engine 205 via 231, The Repair Engine 205 obtains the row of the address. Decoder 95 decodes this address portion 94 A 
and column repair data from either the Failure Bitmap and to select a particular row. A spare row 90 is also provided. 
Analysis Logic 209 or from the Remote Fuse Array 211, and The row repair register and logic 304 within the Row- 
serially sends the same to the Row-Column Repair Column Repair Registers and Logic 257 maps the spare row 
Registers/Logic 257 in the RAM segment 251 via line 229. 90 to replace an identified faulty row, while the column 
The Row-Column Repair Registers/Logic 257 comprises a repair register and logic 302 maps the spare column slice 92 
dynamic allocation map and is responsible for replacing the to replace an identified faulty column slice, 
faulty row and column with the spares. Each column slice 91 is eight bits wide. For each slice, all 

FIG. 3 is a flow diagram illustrating the operation of the eight bits of an addressed row are brought out of the slice, 

BBE 203. The BBE remains in the Idle state until one of the to a multiplexor 96, which selects the correct column using 
DoPwrBB or DoMfgBB signals is asserted. It then performs 35 the column address 94B portion of the address 94. Thus, 

two nearly identical sequences of steps, labeled Pre-Test and each non-spare RAM slice 91 directly corresponds to a bit 

Post -Test, on each segment 81 in RAM. The Repair-1 step of data in the host microprocessor's word width, which, in 

invokes the Repair Engine 205 (HG. 2) and initializes the the preferred embodiment, is 83 bits. 

Row-Column Repair Registers and Logic 257 (FIG. 2) of the During a test cycle, the RAM Test Algorithm Engine 207 

selected RAM segment 81. writes to and reads from target cells. In the read operation, 

Hie BiST-1 step invokes the RAM Test Algorithm Engine each data output bit 97 from the multiplexors 96 passes 

207 which performs the self-test on the selected segment by through the Colunm Repair Register and Logic 302, which 

alternately writing and reading Os and Is to the memory cells may map the spare column slice 92 if mapping information 

within the RAM segment 81, according to a memory test is available. The read mapped data bits are then compared by 
algorithm, such as the MARCH algorithm. Simultaneously, 45 XOR gates 98 with expected reference data fed to Do-Dgj 

the Failure Bitmap and Analysis Logic 209 is updated as respectively. If the output value 97 is not the same as the 

errors are detected. expected written value D0-D82, an XOR gate outputs a 1. 

The Pause-1 step halls the BiST/BiSR flow. During manu- The XOR 98 outputs are then brought to the column ID 

facmring test, which is initiated with a doMfgBB signal 235 encoder circuit 100, which, if only one slice is detected as 
(FIG. 2), the test system 15 off-loads the contents of the 50 faulty, provides the column slice address, or identifier, of the 

Failure Bitmap and Analysis Logic 209 by issuing a doF- faulty slice on the badColumnID signal 225. On the other 

caShift command 245. If required at this lime, the contents hand, if more than one slice is faulty, the multibad signal 227 

of the Failure Bitmap and Analysis Logic 209 are also is asserted. In either case, the error signal 223 is asserted, 

overwritten. When finished, the test system reissues the The output data 97 is available at 221A, which connects to 
doMfgBB command 242 to resume the BiST/BiSR flow. 55 ^® normal chip logic. The Column Repair Register and 

During normal power-on automatic testing, which is initi- Logic 302 also maps input data to the spare column slice 92 

ated with the DoPwrBB command 241 (FIG. 2), the flow if necessary. 

automatically resumes after waiting exactly one cycle in the FIG. 5 shows the Failure Bitmap and Analysis Logic 209, 

Pause-1 stale. comprising the Failure Bitmap CAM (content addressable 
Like the Repair-l step, the Repair-2 step invokes the 60 memory) array 110, Bitmap Update Logic 112 and the Spare 

Repair Engine 205 and, in addition, loads the RAM seg- Allocation Logic 114. 

ment's repair registers either from the repair information In the preferred embodiment, the Failure Bitmap CAM 

stored in the Remote Fuse Array 211 if the permanent repair array 110 has two entries, entryO and entryl. Of course, an 

has already been programmed into the array by the test embodiment having more than one spare row and one spare 
system 15. Otherwise, the loaded repair information is based 65 column would require additional CAM entries. Fields 

on the repair information from the Failure Bitmap and labeled RxAddress and CxAddress, where x is the entry 

Analysis Logic 209. number (0 or 1), hold row and column addresses of up to two 
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distinct faulty cells, where distinct cells are those which do For a given set of values for Vx, Rxmust, Cxmust, Rxhit, 

not have common row or column addresses. Each entry also Cxhit and Multibad, Table 1 gives the next values for VI, 

has three additional flags associated with it: a valid flag Vx, Rxmust, Cxmust and the unrepairable flag. Updatex signals 

a row-must flag (Rxmust) and a column-must flag (Cxmust). are generated to command the CAM array 110 to store the 

The valid flag Vx simply indicates that entryx holds valid 5 row and column addresses of a faulty bit into the indicated 

data. The Rxmust and Cxmust flags signify that the spare entry, or to retain the old addresses. VO will always be set 

row or column must be substituted for the row or column, after the first faulty cell is deteOed and is therefore not 

respectively, identified in the corresponding RxAddress or ^ ^Ute side (126) of Table 1. 

CxAddress field. In addition, there is an unrepairable flag For example, in the first line of Table 1, both valid flags 

116 to indicate that the RAM array is not repairable. lO 0, i.e., neither entry is valid, ROhit is 0, meaning that the 

Inapreferredembodiment.thecontentsoftheCAMarray ^^^^^y ^JJ'^ f,^^'^ ^^^^J""} "^f?^ ^"'^^ 

are shifted and overwritten serially via serial input and f^"^^* ^ ^ result the updateO signal is asserted and the 

output lines 122 when a shiftFCA command issued from a ^^fy row and column address are stored m entryO, 

controller. "^^"^ ^^'^ ^® ^ 

The Bitmap Update Logic 112 takes as input the Multibad 15 _ J° ^tep 518, VI and the Must flags are set according to the 

signal 227 as well as a collection of signals 124 from the T^^le 1 results from the Bitmap Update Logic 112. Row and 

Failure Bitmap CAM Array 110, described in detail in the addresses of the faulty cell are stored in an entry if 

discussion of FIG. 6 below. Whenever a faulty cell is Ae corresponding updatex signal is asserted If t^^ 

detected, the Bitmap Update Logic applies the logic of Table " ^e unrepairab e by the Bitmap Update Logic U2. 

1 to these incoming signals 227, 124, and returns signals 126 on ^« Unrepairable flag 116 is set. In a preferred embodiment, 

back to the Failure Bitmap CAM Array 110, which control ^sUng continues even when the RAM is found to be 

the next state of the Failure Bitmap CAM Anay 110. unrepauable. Thus, a tester cycle is completely deter^^ 

When a test has completed, the Must flags (Rxmust and independent of the number of defects found. 

Cxmust) are read by the Spare Allocation Logic 114 on lines The RAM test algorithm continues to test and process 

118. In addition, the row and column addresses stored in the . - errors until the test is complete. When an entire segment has 

Failure Bitmap CAM Array 110 are read on lines 119. The been tested, the algorithm is complete. The Spare Allocation 

Sparc Allocation Logic 114 applies the logic of Table 2 to Logic 114 takes the valid and must flags 118, and the stored 

decide how to allocate the spares, and provides the necessary and column addresses 119 from the Failure Bitmap 

information on lines 231. The useSpareRow signal indicates CAM Array 110, and applies the logic of Table 2 to decide 

that the spare row should be substituted for the row whose how to aUocatc the spare row and column slice, indicating 

address is indicated by RowAddress. Similarly, the useS- the decision via output signals 231 (FIG. 5). The repair 

pareColumn flag indicates that the spare column slice should engine 205 takes the information and transfers it to the repair 

be substituted for the column slice whose address is indi- registers 302, 304 located in the RAM segment. The process 

cated by ColumnAddress. 500 repeats for each segment. 

FIG. 6 is a flowchart 500 of the test process of the present FIG. 7 shows a preferred embodiment in which the Repair 

invention, which takes place, for each segment, during the 35 Engine (RPE) 205 comprises a multiplexor 300 for selecting 

BiSTl and BiST2 steps shown in FIG. 3, and is described in faulty row and column address information from either the 

conjunction with the previous figures. In step 501, a RAM failure bitmap and analysis logic 209 or from the fuse array 

test algorithm is continually executed until completion. A 211. 

faulty cell is detected when data read from the cell does not The RPE receives faulty row and column address infor- 

match the previously written data. 40 mation 130 from the Failure Bitmap and Analysis Logic 

Upon such a fault deteaion, the error signal 223 (FIGS. (FBM) 209 at one of the two inputs to the RPE multiplexor 

2, 4, 5) is asserted and error processing, shown within 300. This information 231 is the direct result of the decisions 

dashed box 504, takes place. This error processing 504 made according to Table 2 from a self-test executed in the 

occurs concurrently with the RAM test 501, as indicated by BiST-1 or BiST-2 steps of FIG. 3. 

dashed line 503. 45 The fuse array 211 comprises a matrix 210 of fuses. Each 

At step 505, the row and column addresses of the faulty line is dedicated to a memory segment 81. As data from each 

cell, badRowAddress 208 and badColumnID 225 respec- line is loaded from the fuse array 211, the data is presented 

tively are used as separate keys to perform a lookup within at 214 to the other input of RPE multiplexor 300. In addition, 

the Failure Biunap CAM Anay 110. there is a global valid indicator 212 to indicate whether the 

At step 508, it is determined whether a valid entry with a jq fuses have been programmed. The global valid indicator 

matching address has been found. If such an entry exists, the 212, by controlling the multiplexor 300, selects one of the 

hit signals Rxhit and Cxhit, where x is the corresponding two inputs: the fuse array line 214 if die fuse array infor- 

entry number, 0 or 1, are asserted to indicate the match (step mation is valid, and the FBM information 130 otherwise. 

510). If no matching valid entry is found in the Failure The selected information 301 is then split up. Row repair 

Bitmap CAMArray 110, then, in step 512, all four hit signals information is routed to RPE row repair logic 320, and 

are set to 0. column information is routed to RPE column repair logic 

In step 514, the valid flags VO, VI, the row and column 322. The output 303 of the RPE column repair logic 322 is 

Must flags and hit flags, which constitute signals 124 of FIG. passed to a decoder 306 which decodes the column slice 

5, and the Multibad signal 227, are all presented to the address to a word whose length equals the number of column 
Bitmap Update Logic 112 which implements the logic of slices, wherein all bits are 0 except the bit corresponding to 

Table 1, to determine how the Failure Bitmap CAM Array *o column slice to be replaced. 

110 will be updated. The decoder output 305 and RPE row repair logic output 

Table 1 is the tmth table for the Bitmap Update Logic 112. 321 are brought to multiplexor 307 which selects one of the 

The inputs 124, 104 to the Bitmap Update Logic 112 are data signals 305, 321 and serially passes the information 

indicated on the left side of Table 1 under the heading over serial line 308. A column shift clock signal 337 

"Present State," The outputs 126 from the Bitmap Update 65 produced by the decoder 306 controls the selection of 

Logic 112 back to the CAM array 110 are indicated on the multiplexor 307. In addition, the column shift clock signal 

right side of Table 1 under the heading "Next State." 337 and the row shift clock signal 347 generated by the row 
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repair logic 320 are brought to the shift clock control circuit bitmap, the preferred embodiment always finds it. The 

346 which combines the signals into a single clock signal storage requirement for the failure bitmap is minimal. The 

348. This clock signal 348 is used to shift the serial infor- repair analysis is performed on the fly with simple logic, 

mation 308 into the column repair register 302 and the row Furthermore, the row-column repair scheme of the pre- 
repair register 304. both of which form part of the Row- 5 ferred embodiment, combined with RAM segmenting 

Column Repair Register/Logic 257 of FIG. 2. The row repair affords much greater freedom in spare allocation, and there- 

information in the row repair register 304 is now decoded by fore provides higher yield improvement than schemes that 

decoder 95, which is the same decoder used to decode employ just the row-only or column-only repair, 

memory addresses 94. preferred embodiment's strategy of sequential testing 

In the preferred embodiment, there are 83 column slices, and repairing of one RAM segment at a time allows almost 

identified by seven bits. Since seven bits can identify 2''-128 all of the test and repair hardware to be shared among the 

possible column slices, obviously not all of the possible segments, thus keeping the overall hardware overhead low, 

values are used. In fact, for various hardware reasons, it is The only dedicated resources required in a RAM segment 

beneficial to use encoded identifiers which are not all are the spares, the local row*column repair registers and the 

contiguous. See, for example, U.S. application Scr. No. associated repair logic that disables defective row/column 

09/044,275, filed Mar. 19, 1998, assigned to Digital Equip- and enables the spares. This minimizes the impact of the test 

ment Corporation, and incorporated herein by reference. and repair logic on the chip yield. 

FIG. 8 shows details of the decoder 306 of FIG. 7 which The partitioning of the test and repair control logic allows 

compensate for the fact that the column slice identifiers are the control logic to be placed at a convenient location. Only 

not contiguous. The coded column slice address 303 is the most essential logic need be located with the target RAM 

presented to one input of a comparator 332. The output 333 array. 

of a 7-bit counter 330 is presented to the other input of the FinaUy, the preferred embodiment unifies self-repair dur- 

comparator 332. jng manufacturing and fuse-programmed permanent repair 

ThQ counter 330 increments by one upon receiving a for field operation. The features of the Repair Engine allow 

clock signal 331, covering the full range of 0 to 127. When all fuses to be located remotely at a convenient site on chip, 
the counter's value 333 is the same as the column slice ^5 ^jj^j^ jj^-g invention has been particularly shown and 

identifier 303, the comparator outputs a 1. Otherwise, the described with references to preferred embodiments thereof, 

comparator 332 outputs a 0. The comparator's output 305 is it will be understood by those skilled in the art that various 

then fed to multiplexor 307 of FIG. 7. changes in form and details may be made therein without 

The counter's value 333 is at the same time checked by a departing from the spirit and scope of the invention as 

validity check circuit 334 to see if it is a valid encoded defined by the appended claims. 

column slice identifier. Validity may be determined, among For example, the spare column slice need not necessarily 

other methods, by comparing against a list of valid identi- have the same width as the column slices. In other words, 

ficrs. In the preferred embodiment, validity is determined by individual spare columns, or groups of columns, could serve 

counting the number of Is in the counter value. If the to substitute for faulty individual columns or groups of 
number of Is is a pre-determined value, the counter value 35 columns, respectively. In this case, the column slice identi- 

corresponds to a valid column identifier. fier would become a column or column group identifier. The 

Validity check circuit 334 output signal 337 serves as a term column slice address, whether encoded or not, has been 

column shift clock and also provides the multiplexor 307 intended to cover all of these concepts, 

control, as shown in FIG. 7. Where the counter value 333 Of course, the present invention could be equally applied 
corresponds to a valid output slice identifier, the output 337 40 to groups of rows as well as to groups, or slices, of coliunns. 

provides a clock pulse on signal 337 and output 305 is In addition, while the described embodiment comprises 

selected by multiplexor 307. Where the counter valid does eighty-three column slices, each eight bits wide, and 128 

not correspond to a valid output slice identifier, no cbck rows, it will be obvious to one skilled in the art that these are 

signal 337 is produced and the multiplexor 307 selects row exemplary numbers and that other quantities and widths are 
shift data 321. 45 within the scope of the present invention. 

The preferred embodiment's use of a single pair of spare Similarly, while Tables 1 and 2 describe a specific 

row and column slice drastically reduces the complexity of embodiment, it will be obvious to one skilled in the art that 

the update logic required for BiSR. The repair algorithm is other logic which performs essentially the same function 

perfect. That is, if a repair solution exists for a given failure falls within the scope of the present invention. 



TABLE 1 



Present State Next State 
(227, 124) £126J 
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UUUUIIIIT UUUUEAA 
SSSSTTTTY SSSSFTT 
TTTT TTTT 01 



No valid entries: 



0OXXXX0XXXOO00XXO3 0 
0OXXXXX0XXOO0OXXO3 0 
OOXXXXOXXXIO 3 OXXO 3 0 
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TABLE 1-continued 
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(227. 124^ fl26) 
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Both Entries are valid. rOmust and clmust are set: 
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TABLE 1-coDtinued 
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What is claimed is: 

1, A self-repair method for a random access memory 60 
(RAM) array on a chip, the array having a plurality of 
memory cells addressed by their respective columns and 
rows, the method comprising: 
providing at least one spare row of memory cells and at 

least one spare column of memory cells; 65 
providing a failure bitmap having plural entries, each 
entry comprising 



a row address field and a column address field for 
storing a row address and a column address of a 
detected faulty cell, 
a row substitute indicator, and 
a column substitute indicator; 
detecting a faulty memory cell whose row/column address 
matches a row/column address stored in a failure 
bitmap entry, thus indicating that there are plural faulty 
memory cells within the addressed row/column, and 
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setting the respective row/cx)lumn substitute indicator 
of the matching entry to indicate that the spare row/ 
column must be used to substitute for the row/column 
containing the faulty memory cell; 

detecting a faulty memory cell whose row/column address 5 
does not match any row/column address stored in the 
failure bitmap, and storing the row and column 
addresses of the detected faulty memory cell in row and 
column address fields of an entry that does not yet hold 
valid content; and lo 

after detecting all faulty memory cells, allocating the 
spare rows and spare columns, based on the row and 
column address fields and row/column indicator set- 
tings of the failure bitmap; 

each of the above steps being performed by logic resident 
on the chip. 

2. The method of claim 1, wherein detecting faulty 
memory cells comprises: 

writing a value to the memory array; 
reading a value from the memory array; and 
comparing the read and write values to identify faulty 
memory cells in the memory array. 

3. The method of claim 1 wherein a column width 
comprises a plurality of bits. 25 

4. The method of claim 1, wherein allocating spare rows 
comprises: 

allocating spare rows and spare columns to rows and 
columns respectively that have been indicated by the 
substitute indicators as requiring replacement; and 30 

allocating remaining spare rows and spare columns to 
remaining rows and columns whose row and colmnn 
addresses respectively have been stored. 

5. The method of claim 1, the chip containing core logic, 
further comprising: 35 

after repairing the RAM array, using the RAM array to 
test the core logic. 

6. The method of claim 4, further comprising: 
flagging the RAM as unrepairable if the RAM cannot be 

repaired by the spare rows or spare columns. 

7. A random access memory array (RAM), comprising: 
a plurality of memory cells addressed by their respective 

columns and rows; 
at least one spare row of memory cells; 
at least one spare column of memory cells; 
a test and repair logic circuit for locating faulty cells 

within the memory array; 
a failure bitmap having at least two entries, each entry 
comprising 50 
a row address field and a column address field for 
storing a row address and a column address of a 
faulty cell detected by the test and repair logic 
circuit, 

a row substitute indicator and a column substitute 55 
indicator, the test and repair logic, upon detection of 
a faulty memory cell, 

setting, if the detected faulty memory cell's row/ 
column address matches a row/column address 
stored in the failure bitmap, thus indicating that 60 
there are plural faulty memory cells within the 
addressed row/column, the respective row/column 
substitute indicator of the failure bitmap entry 
having the matching row/column address to indi- 
cate that the spare row/column must be used to 65 
substitute for the row/column having the matching 
row/column address, and 



40 



45 



setting, if no match occurs, the row and column 
addresses of the detected faulty memory cell in the 
row and column address fields of a failure bitmap 
entry that does not yet hold valid content; and 
spare allocation logic circuit which, after all faulty 
memory cells have been detected, allocates the spare 
rows and spare columns, based on the row and column 
address fields and row/column indicator settings of the 
failure bitmap. 

8. The RAM of claim 7, wherein each column has a width 
comprising a plurality of bits. 

9. The RAM of claim 8 wherein a column address is a 
unique identifier which identifies a column. 

10. A random access memory array (RAM), comprising: 
a plurality of memory cells addressed by their respective 

columns and rows; 
at least one spare row of memory cells; 
at least one spare column of memory cells; 
a test and repair logic circuit for locating faulty cells 

within the memory array; 
a failure bitmap having at least two entries, each entry 

comprising 

a row address field and a column address field for 
storing a row address and a column address of a 
faulty cell detected by the test and repair logic 
circuit, 

an Rmust indication for indicating that a row refer- 
enced in the row address field must be replaced with 
a spare row, and 

a Cmusi indication for indicating that a column refer- 
enced in the column address field must be replaced 
with a spare column; 
spare allocation logic circuit which uses contents from the 

failure bitmap to allocate the spare rows and spare 

columns; and 

an array of fuses, wherein allocation information is 
burned into the fuses, and wherein the allocation infor- 
mation is read from the fuse array upon startup. 

11. A random access memory array (RAM), comprising: 
a plurality of memory cells addressed by then respective 

columns and rowi^ 
at least one spare row of memory cells; 
at least one spare column of memory cells; 
a test and repair logic circuit for locating faulty cells 

within the memory array; 
a failure bitmap having at least two entries, each entry 

comprising 

a row address field and a column address field for 
storing a row address and a column address of a 
faulty cell detected by the test and repair logic 
circuit, 

a first indicator for indicating that a row referenced in 
the row address field must be replaced with a spare 
row, and 

a second indicator for indicating that a column refer- 
enced in the column address field must be replaced 
with a spare column; 
spare allocation logic circuit allocates the spare rows and 
spare columns, based on the row and column address 
fields and on the first and second indicators; and 
an allocation map, wherein the test and repair logic circuit 
executes a dynamic test at start up, and stores row and 
column allocations in the allocation map. 

12. The RAM of claim 11, further comprising an array of 
fuses, wherein allocation information is burned into the 
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fuses, and wherein it is read from the fuse array upon startup 
if the fuse array allocation information is valid, and is read 
from the failure bitmap if the fuse array aUocation informa- 
tion is not valid. 

13. A computer system comprising a chip, the chip 5 
comprising: 

a random access memory (RAM) array, the RAM com- 
prising 

a plurality of memory cells addressed according to their 
respective columns and rows, 

at least one spare row of memory cells, and 

at least one spare column of memory cells; 
a test and repair logic circuit for locating faulty cells 

within the memory array; 
a failure bitmap having at least two entries, each entry 

comprising 

a row address field and a column address field for 
storing a row address and a column address of a first 
faulty cell detected by the test and repair logic 
circuit, 20 
a first indicator for indicating, if the test and repair 
circuitry detects a second faulty cell having a row 
address that matches the row address stored in the 
row address field, that a row referenced in the row 
address field must be replaced with a spare row, and 
a second indicator for indicating, if the lest and repair 
circuitry detects a second faulty cell having a column 
address that matches the column address stored in 
the column address field, that a column referenced in 
the column address field must be replaced with a 
spare column; and 
spare allocation logic circuit which, after testing of the 
RAM, allocates the spare rows and spare columns 
based on the row and column address fields and the first 
and second indicators. 35 

14. The computer system of claim 13, further comprising 
an allocation map, wherein the test and repair logic circuit 
executes a dynamic test at start up, and stores row and 
column allocations in the allocation map. 

15. The computer system of claim 13, the d)ip further 40 
comprising core logic. 

16. A computer system, comprising: 

a random access memory array (RAM) which further 
comprises; 

a plurality of memory cells addressed by their respec- 45 

tive columns and rows; 
at least one spare row of memory cells; 
at least one spare column of memory cells; 
a repair logic circuit for locating faulty cells within the 

memory array; 50 
a failure bitmap having at least two entries, each entry 

comprising 

a row address field and a column address field for 
storing a row address and a column address of a 
faulty cell detected by the repair logic circuit, 55 
a first indicator for indicating that a row referenced 
in the row address field must be replaced with a 
spare row, and 
a second indicator for indicating that a column 
referenced in the column address field must be go 
replaced with a spare column; 
spare allocation logic circuit which allocates the spare 
rows and spare columns based on the row and column 
address fields and the first and second indicators; 
an allocation map, wherein the test and repair logic circuit 65 
executes a dynamic test at start up, and stores row and 
column allocations in the allocation map; and 



an array of fuses, wherein allocation information is 
burned into the fuses, and wherein the allocation infor- 
mation is read from the fuse array upon startup if the 
fuse array allocation information is valid, and is read 
from the allocation map if the fuse array allocation 
information is not valid. 

17. A semiconductor chip, comprising: 

a random access memory (RAM) array means having a 
plurality of memory cells addressed according to their 
respective columns and rows, comprising: 
row replacement means for replacing a row containing 

at least one faulty memory cell; 
colimm replacement means for replacing a colimin 

containing at least one faulty memory cell; 
failure bitmap means comprising plural entries, each 

entry comprising 

storage means for storing a row address and a 

column address of a detected faulty cell, 
row substitute indicator means, and 
column substitute indicator means; 
means for detecting a faulty memory cell whose row/ 
coltunn address matches a row/column address stored 
in a failure bitmap means entry, and for sening, upon 
said detecting, the respective row/column substitute 
indicator means of said matching entry, and 
means for detecting a faulty memory cell whose row/ 
column address does not match any row/column 
address stored in the failure bitmap means, and for 
storing the row and column addresses of said faulty 
memory cell in said storage means of an entry that does 
not yet hold valid content; and 
means for allocating, after testing of the RAM, the spare 
row/column replacement means based on the stored 
row and column addresses and row/column substitute 
indicator means settings of the failure bitmap means. 

18. The semiconductor chip of claim 17, further compris- 
ing: 

core logic means. 

19. A random access memory array (RAM), comprising: 
a plurality of memory cells addressed by their respective 

columns and rows; 

a test and repair logic circuit for locating faulty cells 
within the memory array; 

at least one spare column of memory cells; 

spare allocation logic circuit which allocates the at least 
one spare column to replace a column with a faulty cell; 

a column identifier encoder for encoding a column iden- 
tifier of a column having a faulty cell; 

storage means for storing an encoded column identifier; 
and 

a decoder for decoding an encoded identifier, the decoder 
further comprising: 

a counter which cycles through at least all allowed 
encoded identifier values, 

a comparator for comparing the counter *s value with a 
stored encoded identifier, the comparator output 
being indicative of a match, and 

a circuit for determining whether the counter's value is 
a valid encoded identifier, and for providing an 
indication thereof, such that when an encoded iden- 
tifier is indicated as valid, the comparator output is 
sent to the spare allocation logic circuit, and the at 
least one spare column allocated accordingly. 

20. The RAM of claim 19, further comprising: 
at least one spare row of memory cells; and 
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a failure bitmap having at least two entries, each entry 
comprising 

a row address field and a column address field for 
storing a row address and an encoded column 
address of a faulty cell detected by the test and repair 5 
logic circuit, 

an Rmust flag for indicating that a row indicated in the 
row address field must be replaced with a spare row, 
and 

a Cmust bit for indicating that a column indicated in the lO 
column address field must be replaced with a spare 
column. 

21. A self -repair method for a random access memory 
(RAM) having a plurality of memory cells addressed by 
their respective columns and rows, the method comprising: 15 
testing the RAM by writing and reading values to a 
sequence of addresses according to a testing algorithm, 
and comparing read values with expected values; 
upon a mismatch, asserting an error signal and providing 
column and row addresses of a failed cell producing the 
mismatch; 

upon a multiple mismatch, asserting a multibad signal and 
providing a row/column address common to failed cells 
producing the mismatches; ^5 

updating a failure bit map upon each assertion of the error 
signal, responsive to the multibad signal, the provided 
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addresses and a current state of the failure bit map to 
indicate results of the testing thus far, the failure bit 
map indicating when a row/column must be replaced 
with a spare row/column due to multiple faulty cells in 
a particular row/column; and 
allocating spare rows and columns responsive to the state 
of the failure bit map upon the completion of testing. 

22. The method of claim 21, wherein the memory com- 
prises plural segments which are independently repairable. 

23. The method of claim 21, wherein testing and repairing 
are done dynamically upon a command. 

24. The method of claim 21, wherein testing and repairing 
are done dynamically upon providing power to the RAM. 

25. The method of claim 21, wherein the RAM is embed- 
ded onto a microprocessor chip. 

26. The method of claim 22, wherein test and repair 
hardware are shared among the plural segments. 

27. The method of claim 22, wherein the failure bitmap is 
shared among the plural segments. 

28. The method of claim 25, wherein the RAM is an 
on-chip cache. 

29. The method of claim 28, wherein after allocating spare 
columns and rows, other functions on the chip are tested. 

* « * * * 
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